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AMENDMENTS TO THE CLAIMS 

This listing of claims will replace all prior versions and listings of claims in the 
application: 

LISTING OF CLAIMS: 

1. (Currently Amended) A method for summarizing a content of an input video sequence, 
said input vid e o s e qu e nc e comprising a plurality of fram e s, said plurality of fram e s b e ing 
group e d into a plurality of vid e o s e gm e nts, said method comprising: 

(a) computing a feature vector for each frame in a set of frames from s e l e cting a 
frame clust e r in said input video sequence which corresponds to a most static on e of said vid e o 
segm e nts ; and 

(b) applying singular value decomposition to a matrix comprised of said feature 
vectors and projecting the matrix on a refined feature space representation, wherein positions of 
said projections on said refined feature space representation represent approximations of visual 
changes in said set of frames from said input video sequence computing a cont e nt valu e in said 
s e l e ct e d fram e clust e r; 

(e) using said comput e d content valu e to cluster r e maining fram e s in said input vid e o 

sequenc e. 

2-6. (Cancelled). 
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7. (Currently Amended) The method of claim l[[6]] 9 wherein said singular value 
decomposition is performed using frames selected with a fixed interval from said input video 
sequence. 

8. (Cancelled). 

9. (Currently Amended) The method of claim J_[[6]], wherein said singular valu e 
decomposition produces a matrix, each column of said matrix represents repr e s e nting a frame in 
said a -refined feature space representation corr e sponding to a fram e in said input video sequ e nc e. 

10-30. (Cancelled). 

31. (Currently Amended) A computer-readable medium containing a program for 
summarizing a content of an input video sequence, said input vid e o s e quenc e comprising a 
plurality of fram e s, said plurality of fram e s b e ing group e d into a plurality of vid e o s e gm e nts, 
said program comprising: 

(a) computing a feature vector for 

each frame in a set of frames from sel e cting a fram e clust e r in said input video sequence-^whieh 
corresponds to a most static video s e gm e nt ; and 

(b) applying singular value decomposition to a matrix comprised of said feature 
vectors and projecting the matrix on a refined feature space representation, wherein positions of 
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said projections on said refined feature space representation represent approximations of visual 
changes in said set of frames from said input video sequence computing cont e nt valu e in said 
sel e ct e d fram e clust e r; 

(e) using said comput e d cont e nt valu e to clust e r r e maining fram e s in said input vid e o 

s e qu e nc e. 

32-36. {Cancelled). 

37. {Currently Amended) The computer-readable medium of claim 3i[[36]]> wherein said 
singular value decomposition is performed using frames selected with a fixed interval from said 
input video sequence. 

38. {Cancelled). 

39. {Currently Amended) The computer-readable medium of claim 31[[36]], wherein said 
singular valu e d e composition produc e s a matrix, each column of said matrix represents 
repr e senting a frame in said a -refined feature space representation corr e sponding to a fram e in 
s aid input vid e o s e quenc e. 

40-72. {Cancelled). 
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73. (New) The method of claim 1, wherein said feature vectors are computed using a 
color histogram that outputs a histogram vector. 

74. (New) The method of claim 73 , wherein said histogram vector is indicative of a 
spatial distribution of colors in said each of said frames. 

75. (New) The method of claim 74, wherein each of said frames is divided into a plurality 
of blocks, each of said plurality of blocks being represented by a histogram in a color space 
indicative of a distribution of colors within each of said blocks. 

76. (New) The method of claim 74, wherein each of said frames is divided into a plurality 
of blocks and said histogram vector comprises a plurality of histograms in a color space, each of 
said plurality of histograms corresponding to one of said plurality of blocks. 

77. (New) The method of claim 1, wherein said method further comprises clustering said 
frames of said input video sequence based upon positions of said projections on said refined 
feature space representation. 

78. (New) The method of claim 77, wherein said method further comprises selecting a 
frame from each cluster to serve as a keyframe in a summarization of said input video sequence. 
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79. (New) The method of claim 78, wherein said selecting a frame comprises locating a 
frame with a feature vector that projects into a singular value that is most representative of other 
singular values of the cluster. 

80. (New) The method of claim 78, wherein said method further comprises using said 
clustered frames to output a motion video representative of a summary of said input video 
sequence. 

81. (New) The method of claim 80, wherein said input video sequence summary is 
composed according to a time-length parameter T/ e/I and a minimum display time parameter T mifJ . 

82. (New) The method of claim 81, wherein the composition of said input video sequence 
summary further comprises: 

locating the video shot 0/ in each cluster S,- having the greatest length; 
determining how the video shots in each cluster will be arranged according to 
C<N = T /en /T M/ll , 

wherein C represents a number of clusters; and 
wherein N represents the maximum number of video shots; 
if C < N, then all the video shots in each cluster is included in said input video sequence 
summary; and 
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if C > N, then sort each video shot 0, from each cluster S/ in descending order by length, 
select the first N video shots for inclusion in said input video sequence summary and assign time 
length T min to each selected video shot. 

83. (New) The method of claim 82, wherein the composition of said input video sequence 
summary further comprises sorting the selected video shots by their respective time codes. 

84. (New) The method of claim 83, wherein the composition of said input video sequence 
summary further comprises extracting a portion of selected video shot equal in length to time 
length T min and inserting each extracted portion in order to said input video sequence summary. 

85. (New) The method of claim 77, wherein said clustering of said frames further 
comprises using a position of the most static shot of said input video sequence to compute a 
value as a threshold during the clustering of said frames. 

86. (New) The method of claim 85, wherein said clustering of said frames further 
comprises computing a content value and using said computed content value to cluster the 
remaining frames by: 

sorting said feature vectors in said refined feature space representation in ascending order 
according to a distance of each of said feature vectors to an origin of said refined feature space 
representation; 
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selecting a vector among said sorted feature vectors which is closest to an origin of said 
refined feature space representation and including said selected feature vector into a first cluster; 

clustering said plurality of sorted feature vectors in said refined feature space 
representation into a plurality of clusters according to a distance between each of said plurality of 
sorted feature vectors and feature vectors in each of said plurality of clusters and an amount of 
information in each of said plurality of clusters. 

87. (New) The method of claim 86, wherein, in said clustering of sorted feature vectors, 
said plurality of sorted feature vectors are clustered into said plurality of clusters such that said 
amount of information in each of said plurality of clusters does not exceed an amount of 
information in said first cluster. 

88. (New) The method of claim 86, wherein said first cluster is composed of frames based 
on a distance variation between said frames and an average distance between frames in said first 
cluster. 

89. (New) The method of claim 86, wherein each of said plurality of clusters is composed 
of frames based on a distance variation between said frames and an average distance between 
frames in said each of said plurality of clusters. 



8 



Atty. Docket No.: CA1055 
PATENT APPLICATION 

AMENDMENT UNDER 37 C.F.R. §1.111 
U.S. Application No.: 09/556,349 

90. (New) The computer-readable medium of claim 31, wherein said feature vectors are 
computed using a color histogram that outputs a histogram vector. 

91. (New) The computer-readable medium of claim 90, wherein said histogram vector is 
indicative of a spatial distribution of colors in said each of said frames. 

92. (New) The computer-readable medium of claim 91, wherein each of said frames is 
divided into a plurality of blocks, each of said plurality of blocks being represented by a 
histogram in a color space indicative of a distribution of colors within each of said blocks. 

93. (New) The computer-readable medium of claim 91, wherein each of said frames is 
divided into a plurality of blocks and said histogram vector comprises a plurality of histograms 
in a color space, each of said plurality of histograms corresponding to one of said plurality of 
blocks. 

94. (New) The computer-readable medium of claim 31, wherein said method further 
comprises clustering said frames of said input video sequence based upon positions of said 
projections on said refined feature space representation. 
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95. (New) The computer-readable medium of claim 94, wherein said method further 
comprises selecting a frame from each cluster to serve as a keyframe in a summarization of said 
input video sequence. 

96. (New) The computer-readable medium of claim 95, wherein said selecting a frame 
comprises locating a frame with a feature vector that projects into a singular value that is most 
representative of other singular values of the cluster. 

97. (New) The computer-readable medium of claim 95, wherein said method further 
comprises using said clustered frames to output a motion video representative of a summary of 
said input video sequence. 

98. (New) The computer-readable medium of claim 97, wherein said input video 
sequence summary is composed according to a time-length parameter T/ e „ and a minimum 
display time parameter T min . 

99. (New) The computer-readable medium of claim 98, wherein the composition of said 
input video sequence summary further comprises: 

locating the video shot 0, in each cluster S, having the greatest length; 
determining how the video shots in each cluster will be arranged according to 
C<N = T / «/r IH/ „, 
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wherein C represents a number of clusters; and 
wherein N represents the maximum number of video shots; 
if C < N, then all the video shots in each cluster is included in said input video sequence 
summary; and 

if C > N, then sort each video shot 0, from each cluster S,- in descending order by length, 
select the first N video shots for inclusion in said input video sequence summary and assign time 
length T min to each selected video shot. 

100. (New) The computer-readable medium of claim 99, wherein the composition of said 
input video sequence summary further comprises sorting the selected video shots by their 
respective time codes. 

101. (New) The computer-readable medium of claim 100, wherein the composition of 
said input video sequence summary further comprises extracting a portion of selected video shot 
equal in length to time length T min and inserting each extracted portion in order to said input 
video sequence summary. 

102. (New) The computer-readable medium of claim 94, wherein said clustering of said 
frames further comprises using a position of the most static shot of said input video sequence to 
compute a value as a threshold during the clustering of said frames. 
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103. (New) The computer-readable medium of claim 102, wherein said clustering of said 
frames further comprises computing a content value and using said computed content value to 
cluster the remaining frames by: 

sorting said feature vectors in said refined feature space representation in ascending order 
according to a distance of each of said feature vectors to an origin of said refined feature space 
representation; 

selecting a vector among said sorted feature vectors which is closest to an origin of said 
refined feature space representation and including said selected feature vector into a first cluster; 

clustering said plurality of sorted feature vectors in said refined feature space 
representation into a plurality of clusters according to a distance between each of said plurality of 
sorted feature vectors and feature vectors in each of said plurality of clusters and an amount of 
information in each of said plurality of clusters. 

104. (New) The computer-readable medium of claim 103, wherein, in said clustering of 
sorted feature vectors, said plurality of sorted feature vectors are clustered into said plurality of 
clusters such that said amount of information in each of said plurality of clusters does not exceed 
an amount of information in said first cluster. 
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105. (New) The computer-readable medium of claim 103, wherein said first cluster is 
composed of frames based on a distance variation between said frames and an average distance 
between frames in said first cluster. 

106. (New) The computer-readable medium of claim 103, wherein each of said plurality 
of clusters is composed of frames based on a distance variation between said frames and an 
average distance between frames in said each of said plurality of clusters. 
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